AITopics | adaptation network

33a854e247155d590883b93bca53848a-Supplemental.pdf

Neural Information Processing SystemsApr-25-2026, 10:12:38 GMT

artificial intelligence, dataset, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation

Neural Information Processing SystemsApr-25-2026, 10:12:34 GMT

Large pretrained language models (LMs) like BERT have improved performance in many disparate natural language processing (NLP) tasks. However, fine tuning such models requires a large number of training examples for each target task. Simultaneously, many realistic NLP problems are "few shot", without a sufficiently large training set. In this work, we propose a novel conditional neural process-based approach for few-shot text classification that learns to transfer from other diverse tasks with rich annotation. Our key idea is to represent each task using gradient information from a base model and to train an adaptation network that modulates a text classifier conditioned on the task representation. While previous task-aware few-shot learners represent tasks by input encoding, our novel task representation is more powerful, as the gradient captures input-output relationships of a task. Experimental results show that our approach outperforms traditional fine-tuning, sequential transfer learning, and state-of-the-art meta learning approaches on a collection of diverse few-shot tasks. We further conducted analysis and ablations to justify our design choices.

machine learning, natural language, text classification, (17 more...)

Neural Information Processing Systems

Country:

Europe (0.67)
North America > Canada (0.46)
North America > United States (0.46)

Genre: Research Report (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

f3da4165893c2465fd7e8df453c41ffa-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 23:03:04 GMT

artificial intelligence, deep learning, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.37)

Add feedback

1138d90ef0a0848a542e57d1595f58ea-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 12:25:34 GMT

dataset, feature extractor, learning, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.05)
North America > United States > Minnesota (0.04)
North America > Canada (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

33a854e247155d590883b93bca53848a-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 04:34:38 GMT

base model, dataset, representation, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.66)

Add feedback

f3da4165893c2465fd7e8df453c41ffa-Supplemental-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 11:46:54 GMT

artificial intelligence, deep learning, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.37)

Add feedback

1138d90ef0a0848a542e57d1595f58ea-Paper.pdf

Neural Information Processing SystemsOct-2-2025, 03:21:47 GMT

artificial intelligence, learning, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America (0.28)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

A Unified Analysis of Generalization and Sample Complexity for Semi-Supervised Domain Adaptation

Vural, Elif, Karaca, Huseyin

arXiv.org Machine LearningJul-31-2025

Domain adaptation seeks to leverage the abundant label information in a source domain to improve classification performance in a target domain with limited labels. While the field has seen extensive methodological development, its theoretical foundations remain relatively underexplored. Most existing theoretical analyses focus on simplified settings where the source and target domains share the same input space and relate target-domain performance to measures of domain discrepancy. Although insightful, these analyses may not fully capture the behavior of modern approaches that align domains into a shared space via feature transformations. In this paper, we present a comprehensive theoretical study of domain adaptation algorithms based on domain alignment. We consider the joint learning of domain-aligning feature transformations and a shared classifier in a semi-supervised setting. We first derive generalization bounds in a broad setting, in terms of covering numbers of the relevant function classes. We then extend our analysis to characterize the sample complexity of domain-adaptive neural networks employing maximum mean discrepancy (MMD) or adversarial objectives. Our results rely on a rigorous analysis of the covering numbers of these architectures. We show that, for both MMD-based and adversarial models, the sample complexity admits an upper bound that scales quadratically with network depth and width. Furthermore, our analysis suggests that in semi-supervised settings, robustness to limited labeled target data can be achieved by scaling the target loss proportionally to the square root of the number of labeled target samples. Experimental evaluation in both shallow and deep settings lends support to our theoretical findings.

artificial intelligence, deep learning, machine learning, (19 more...)

arXiv.org Machine Learning

2507.22632

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Asia > Middle East > Jordan (0.04)
Asia > Middle East > Republic of Türkiye > Ankara Province > Ankara (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

From Electrode to Global Brain: Integrating Multi- and Cross-Scale Brain Connections and Interactions Under Cross-Subject and Within-Subject Scenarios

Zhige, Chen, Chengxuan, Qin

arXiv.org Artificial IntelligenceNov-7-2024

According to the study of brain connectomics [29] and the aforementioned statement above, the topological connection Spurred on by the advent of advanced non-invasive techniques of the human brain takes place on three separate levels with such as electroencephalogram (EEG), explorations of different scales, inextricably linked with the geometry of the brain networks have entered a new era [40]. The proposed multi-scale spatial data distribution as a remarkable organ, exhibits a high level of time-varying differences can thus be concluded as three categories under complexity attributed to the intricate nature of the structural different brain scales: connections among its constituent units [4]. To the best of the authors' knowledge, The deep domain adaptation (DDA) method combines the no previous work has integrated the multi-scale spatial data superiority of deep learning and transfer learning, becoming distribution problem with the deep domain adaptation network one of the most efficient tools to address the data distribution (DDAN), neither on the design of the CNN structure nor difference problem in cross-subject EEG classification tasks the establishment of the adaptation domain. More and more researchers utilize this powerful integrate the principles of multi-scale brain topological structures tool to solve cross-subject motor imagery (MI) classification in order to solve the multi-scale spatial data distribution problems [35], [37], [38], aiming to improve the model generalization difference problem [29], a novel multi-scale spatial domain and the classification performance by transferring adaptation network (MSSDAN) consists of both multi-scale knowledge from source domain subject. The existing three types of crosssubject A. Overview of MSSDAN MI classification (MTM: multi-source to multi-target, MTS: multi-source to single-target, and STS) DDA methods In this paper, we propose MSSDAN, a new domain adaptation focus more on the global [15], [39], [41], class [14], [20], and method for the brain-computer interface, which consists of temporal domain adaptations [2], [5].

artificial intelligence, classification, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2411.05862

Genre: Research Report (0.70)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Sim-to-Real Domain Adaptation for Deformation Classification

Sol, Joel, Fayyad, Jamil, Alijani, Shadi, Najjaran, Homayoun

arXiv.org Artificial IntelligenceJul-13-2024

Deformation detection is vital for enabling accurate assessment and prediction of structural changes in materials, ensuring timely and effective interventions to maintain safety and integrity. Automating deformation detection through computer vision is crucial for efficient monitoring, but it faces significant challenges in creating a comprehensive dataset of both deformed and non-deformed objects, which can be difficult to obtain in many scenarios. In this paper, we introduce a novel framework for generating controlled synthetic data that simulates deformed objects. This approach allows for the realistic modeling of object deformations under various conditions. Our framework integrates an intelligent adapter network that facilitates sim-to-real domain adaptation, enhancing classification results without requiring real data from deformed objects. We conduct experiments on domain adaptation and classification tasks and demonstrate that our framework improves sim-to-real classification results compared to simulation baseline.

adaptation, dataset, deformation, (15 more...)

arXiv.org Artificial Intelligence

2407.10011

Country: North America > Canada > British Columbia > Vancouver Island > Capital Regional District > Victoria (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback